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Abstract 

Eukaryotes tune the transcriptional activity of their genome by altering the nucleosome core particle 
through multiple chemical processes. In particular, replacement of the canonical H2A histone with 
the variants macroH2A and H2A.Z has been shown to affect DNA accessibility and nucleosome 
stability; however, the processes by which this occurs remain poorly understood. Here, we elucidate 
the molecular mechanisms of these variants with an extensive molecular dynamics study of the 
canonical nucleosome along with three variant-containing structures: H2A.Z, macroH2A, and an 
H2A mutant with macroH2A-like LI loops. Simulation results show that variant LI loops play a 
pivotal role in stabilizing DNA binding to the octamer through direct interactions, core structural 
rearrangements, and altered allosteric networks in the nucleosome. All variants influence dynamics; 
however, macroH2A-like systems have the largest effect on energetics. In addition, we provide a 
comprehensive analysis of allosteric networks in the nucleosome and demonstrate that variants 
take advantage of stronger interactions between LI loops to propagate dynamics throughout the 
complex. Furthermore, we show that post-translational modifications are enriched at key locations 
in these networks. Taken together, these results provide new insights into the relationship between 
the structure, dynamics, and function of the nucleosome core particle and chromatin fibers, and 
how they are influenced by chromatin remodelling factors. 

*To whom correspondence should be addressed 
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Introduction 


Eukaryotes package their genetic code in highly ordered chromatin hbers. The fundamental 
unit of these structures is the nucleosome core particle (NCP), a complex of ~147 basepairs 
of DNA that are wrapped around eight histone proteins (Figure 1)T Although they have 
minimal sequence homology, each core histone has a structural motif of an N-terminal tail, 
three a-helices connected by two loops (o(l-Ll-o(2-L2-a3), and a C-terminal tailA’^ In the 
assembled NCP, histones are structurally divided into a (H3-H4)2 tetramer that is positioned 
between two H2A-H2B dimers. The only location of inter-dimer interactions is at the base 
of the NCP which is formed by the H2A LI loops, whereas each dimer has two interfaces 

(DD) and the H3-H4 “four helix bundle.” 

Cells regulate chromatin stability and 
DNA accessibility by changing the biochem¬ 
ical properties of the NCP.^^® One of the 
primary chromatin remodeling mechanisms 
is the replacement of H2A or H3 histones 
with “histone variants.” These variants 
have a similar structure and sequence to the 
canonical histones, however they diverge at 
key locations that affect inter-histone and 
DNA-histone contacts. These differences al¬ 
ter the structure and stability of the NCP 
and are therefore implicated in modulating 
transcriptional activity. For example, the 
H2A variant macroH2A exists in large pop¬ 
ulations in the inactive X chromosome of fe¬ 
males but is sparse in active genes. In 

contrast, the H2A.Z variant has been linked 
to both transcriptional activation and re¬ 
pression and is enhanced in regulatory re¬ 
gions of the genome such as promoters and 
enhancers. 

Histone variants influence chromatin 
through diverse mechanisms and struc¬ 
ture/function relationships. macroH2A is 
unique among variants in that it possesses 
multiple domains, including the histone do¬ 
main, a 38 residue linker sequence, and a 
large “macro-domain.” On its own, the 

histone domain is sufhcient for reducing 
transcriptional activity in vivo and increasing the stability of the nucleosome complex, even 
though the crystal structure of an NCP containing this domain shows that variant incorpo¬ 
ration causes only minor NCP rearrangements. The primary sequence is ~65% identical 
to canonical H2A and differs largely from H2A in two important regions: the LI loops and 
the docking domain. The canonical ^®NYAE^^ LI loop possesses a net negative charge, while 


with the tetramer: the H2A “docking domain” 



Figure 1: (left) The nucleosome core particle 
viewed down the DNA superhelical axis. Co¬ 
ordinates are taken from the hnal snapshot of 
the hrst canonical nucleosome simulation. Color 
code: H3 (red), H4 (blue), H2A (green), H2B 
(purple), and DNA (grey), (right) The struc¬ 
tures of the three LI loop sequences considered in 
this study: canonical (top), macroH2A (middle), 
and H2A.Z (bottom). The canonical loops pos¬ 
sess a net negative charge resulting from Glu41, 
while the macroH2A loops possess a net positive 
charge from Lys40. The LI loops of H2A.Z are 
uncharged, but both macroH2A and H2A.Z loops 
introduce a larger hydrophobic volume than the 
canonical. 
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in contrast the macroH2A LI ^®HPKY'^^ sequence has a net positive charge and an increased 
hydrophobicity. Substitutions of the LI loops in canonical H2A with a macroH2A sequence 
(the “Ll-Mutant”) creates nucleosomes with in vitro stabilities and in vivo enrichments 
that are nearly identical to NCPs containing the complete macroH2A histone domainP^’^^ 
Therefore, the LI loops appear to be pivotal in dictating macroH2A’s abilities to affect intra- 
nucleosomal functions. Meanwhile, changes to the docking domains show little effect on in 
vitro stability, but increase in vivo enrichment. 

The role and mechanisms of the H2A.Z variant remains less well dehned, with some 
experiments showing that H2A.Z increases NCP stability while others have found that it 
destabilizes the system. Similar to macroH2A, a comparison of the H2A.Z and canonical 
containing NCP crystal structures show nearly identical overall conformations with the ex¬ 
ception of two features."^ First, the structure of the LI loops is altered, resulting in increased 
contacts between the two H2A/H2B dimers which likely helps to stabilize the histone oc- 
tamer. Second, H2A.Z has fewer hydrogen bonds between the docking domain and H3, which 
could destabilize the dimer/tetramer interface. This combination of stabilizing one area of 
the NCP while destabilizing another may account for the disparate experimental results and 
the multiple functions H2A.Z appears to have.^^ 

Experiments have revealed a wealth of information about how histone variants affect NCP 
and chromatin function, yet several questions still remain. For example: how do seemingly 
minor structural rearrangments affect the stability of the nucleosome? To what extent do 
changes in the LI loops propagate through the complex? Do variants influence NCP function 
through only structural means, or do they take advantage of altered dynamics as well? To 
address these problems, we have performed extensive molecular dynamics (MD) simulations 
of four complete NCP systems that include: 1) canonical H2A histones, 2) the macroH2A 
histone fold domains, 3) “Ll-Mutant” H2A histones, and 4) H2A.Z histones. Our results 
indicate that different sequences in the LI loops perturb the dynamic and energetic properties 
in this region of variant containing NCPs. These effects propagate throughout the complex 
and create subtle, yet important, rearrangements that alter the NCP structure and dynamics 
through both direct effects and modihed allosteric networks. This allows histone variants 
to influence both the global dynamics and energetics of the NCP, and likely contributes to 
large-scale structural changes such as DNA breathing and nucleosome opening, as well as 
inter-NCP interactions in chromatin hbers.^^’^^ 


Materials and Methods 

System and Simulation Details 

Simulations of the canonical, macroH2A, and H2A.Z containing nucleosomes were initial¬ 
ized from their crystal structures (PDB: 1KX5, 1U35, and 1F66, respectively).The Ll- 
Mutant structure was formed using the crystal structure of 1KX5, with the H2A LI loops 
mutated from the canonical ^^NYAE"^^ to the macroH2A ^®HPKY^^ sequence. The systems 
were neutralized and solvated in a 10 A TIP3P box of ISOmM NaCl, creating systems of 
approximately 250,000 atoms. Each system was simulated three times (see supplemental 
material for more details). The simulations were done in the NAMD engine (v2.9) using the 
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AMBER12SB fixed point-charge forcefield.^®’^® Monovalent ions were modeled according to 
Joung and Cheatham. Production simulations were done in the NPT ensemble using stan¬ 
dard techniques. Coordinates were stored every 2 ps. Visualizations were made using 
VMD and PyMOL.32-34 

Allosteric Pathways Calculations 

Allosteric effects were computed with multiple techniques (for specihc details, see sup¬ 
plemental materials). Per-residue differences in dynamics were determined by calculating 
the Kullback-Leibler divergence of dihedral angle populations.^^ For these calculations, the 
canonical populations were used as a reference set. Spurious results were hltered using boot¬ 
strapping techniques. Residue-residue correlations were calculated by utilizing the “largest 
mutual information” method. Residue contacts were determined to be when protein Ca 
or nucleic Cl’ atoms were within 10 A in at least 70% of the conhgurations.^® The mapping 
of allosteric networks was conducted using the Weighted Implementation of Sub-optimal 
Pathways approach. The edge-betweenness centrality of residues in the optimal networks 
were calculated with the NetworkX Python package, with the signihcance determined by a 
hypergeometric distribution (see supplemental materials). 

Interaction Energies 

Interaction energies between the LI loops were calculated using cpptraj.'^^ A cutoff distance 
of 15 A was used for both van der Waals and electrostatic interactions. Because the LI loops 
interact with both protein and DNA, an intermediate dielectric value of 5 was considered 
(sdna = 8, Eprotein = A) Hydrogen bonds were dehned by a separation distance of 3.5 A 
and an angle of 30°. 

DNA binding and complex assembly energies were calculated using the MMPBSA.py 
function of AMBERTOOLS (v.l4).^® The level of theory was restricted to the Generalized 
Born Implicit Solvent (igb=5, radii=mbondi2).^^ Coordinates from every lOOps of production 
simulation were used. The coordinates for the protein constituents were extracted from the 
nucleosome simulations, but the unbound DNA coordinates were taken from a separate 
simulation of 147bp of linear B-form DNA in a 150 mM NaCl environment. Error bars in 
the energies and all other measures are dehned by the standard error of the mean, where the 
number of independent points was determined by the statistical inefficiency of the data set, 
as computed with the PyMBAR package. 


Results 

We performed three independent 250 ns MD simulations for four complete NCP systems: 
1) canonical, 2) macroH2A histone fold, 3) “Ll-Mutant,” and 4) H2A.Z containing NCPs. 
In each set of simulations, approximately 50 ns was required for the root-mean square de¬ 
viations (RMSDs) of the complexes to stabilize (Figures S2-S5) and for the tails to collapse 
from their initial elongated states. These results are consistent with previous MD simula¬ 
tions of the canonical NCP which demonstrated that the overall complex is stable on the 
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hundreds of ns timescale and that the histone tails form strong interactions with the nucle- 
osomal Comparisons between the canonical and variant systems demonstrate 

that variants have both subtle and large-scale effects on the structure and dynamics of the 
LI loops, DNA-histone interactions, and allosteric networks throughout the NCP. 

Altered Dynamics of LI Loops 

Modifications of the LI loop se¬ 
quences in histone variants alter 
their dynamics and energetics. In 
the canonical NCP simulations, an 
average of 0.5 hydrogen bonds were 
formed between the LI loops, pri¬ 
marily between the carboxamide 
nitrogen of asparagine and the car- 
boxylate group of the symmetric 
consistent in the 
but is reduced to 
~0.2 in macroH2A. In these two 
systems, the most prevalent hydro¬ 
gen bonds were formed between the 
phenol oxygen of tyrosine and the 
lone pair of the symmetric histi¬ 
dine. In the H2A.Z simulations, 

Ll-Ll hydrogen bonds were almost 
nonexistent (Table SI). 

Although hydrogen bonds form most frequently in the canonical loops, the net LI loop 
interaction energies are more favorable in the variants (Table 1). The close proximity of 
the negatively charged glutamates in the canonical NCP creates a disfavored electrostatic 
interaction. However, the Ll-Mutant and macroH2A systems avoid an analogous situation 
through Lys-DNA interactions, which separates the like-charges and creates a more favorable 
electrostatic configuration. Meanwhile, the lack of charge in the H2A.Z LI Loops also creates 
a more favorable electrostatic interaction than the canonical NCP. In addition, the LI loop 
rearrangement in the macroH2A and Ll-Mutant systems is further stabilized by van der 

Table 1: Net interaction energies between H2A LI loops show that macroH2A and H2A.Z loops 
have more favorable interactions than canonical LI loops. AU is defined as the difference in energies 
between each variant and the canonical system. Negative values show favorability in the variants. 
All values are reported in kcal/mol. 


System 

Uelect 

Uvdw 

Utot 

^Uelect 

AUvdw 

AUtot 

Canonical NCP 

6.3 ± 2.0 

-9.4 ± 0.8 

-3.1 ± 2.2 

— 

— 

— 

LI Mutant 

-0.9 ± 0.4 

-13.4 ± 0.8 

-14.3 ± 0.9 

-7.2 ± 2.0 

-4.0 ± 0.8 

-11.2 ± 2.4 

macroH2A NCP 

0.4 ± 1.1 

-12.1 ± 0.9 

-11.7 ± 1.0 

-5.9 ± 2.3 

-2.7 ± 0.9 

-8.6 ± 2.3 

H2A.Z NCP 

-0.6 ± 0.1 

-6.9 ± 0.6 

-7.5 ± 0.6 

-6.9 ± 2.0 

2.4 ± 1.0 

-4.4 ± 2.3 


glutamate. This is 
Ll-mutant (~0.4) 
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Figure 2: (a) Distance populations for Lys^*^ to DNA phos¬ 
phate show an interaction that is unique to macroH2A-like 
LI loops, (b) A representative configuration of the Lys 
sidechain stretching across the molecule to interact with 
the dimer’s non-associated DNA. This orientation sterically 
hinders the symmetric loop from forming a similar interac¬ 
tion. This interaction contributes significantly to stabiliz¬ 
ing DNA-octamer binding in the macroH2A and Ll-Mutant 
systems. 
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Waals interactions. In total, the interaction energies of the LI loops in the Ll-Mutant and 
macroH2A structures are substantially favored over those of the canonical nucleosome, with 
respective AUtotai values of -11.2 ± 2.4 kcal/mol and -8.6 ± 2.3 kcal/mol. The H2A.Z LI 
loop conformations are also more favorable than in the canonical system (AUtotai = -4.4 ± 
2.3 kcal/mol). 

The different net charges of the LI loops influence their interactions with the nucleosomal 
DNA. In the canonical LI loops, the negative charge located on Glu41 causes a repulsive 
force to the negatively charged DNA. However, in the macroH2A and Ll-Mutant systems, 
Lys40 introduces a positive charge into the loop which forms a salt bridge with the DNA 
basepair across the axis of symmetry (Figure 2). The lysine forming this salt bridge sterically 
hinders the symmetric lysine residue from doing the same, so the interaction exists in only 
one dimer. The non-interacting lysine is primarily exposed to solvent while intermittently 
forming a hydrogen bond with a neighboring histidine. Since the LI loops of H2A.Z are 
uncharged, they are not capable of forming similar interactions, and therefore did not make 

any direct contacts with the DNA. Taken with the re¬ 
sults of the Ll-Ll loop dynamics, we observe that the 
macroH2A-hke loop sequences stabilize both protein- 
protein and protein-DNA interactions when compared 
to both the canonical and H2A.Z histones. 

Variant Presence Alters Dimer 
Orientations 

Reorganization of the L 1-loops creates perturbations 
that affect the dimer orientations in the NCR. For ex¬ 
ample, the H2A a2 helix extends across the dimer, with 
its N-terminal (the “base”) at the Ll-interface and its 
C-terminal (the “top”) solvent-exposed on the far side 
of the molecule (Figure 3). Simulation analysis showed 
that the canonical system exhibited a separation of 67.1 
± 0.2 A between the tops, and 34.8 ± 0.1 A between the 
bottoms of the H2A a2 helices. In contrast, in each of 
the variant NCPs there is a “bulging” motion in which 
the base separation is increased to ~36.5 A while the 
top separation is decreased to ~66.1 A. Although these 
changes in orientation are only on the order of an A, a 
t-test indicated that they are all extremely statistically 
signihcant (Figure 3). 

This subtle re-orientation of the dimers alters 
histone-DNA hydrogen bonding. For example, the 
guanidine group of H2A Arg29 forms a hydrogen bond 
with the phosphate group of the 23rd basepair of DNA 
in all systems. In the canonical NCP, this bond forms 
in 63% of the conhgurations, whereas in the L 1-mutant 



System 

top 


bottom 

Canonical NCP 

67.1 ± 

.2 

34.8 ± .1 

Ll-Mutant 

66.0 ± 

.2 

36.8 ± .4 

p-value 

.0003 


.0001 

macroH2A NCP 

66.2 ± 

.3 

36.2 ± .1 

p-value 

.0001 


.0001 

H2A.Z NCP 

66.0 ± 

.1 

36.1 ± .1 

p-value 

.0001 


.0001 

Figure 3: 

Separation 

distances 


for H2A a2 helix locations show a 
“bulging” effect in histone variants. 
The helix is displayed in black while 
the helix “top” is highlighted in red 
and the “bottom” in blue. The Ll- 
loops are shown in green for clarity. 
Shifts in mean separation are on the 
order of an A, but the changes in pop¬ 
ulations are all incredibly signihcant. 
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it is formed 70% of the time. In addition, the frequency of hydrogen bonding between 
the 22nd basepair phosphate and the backbone amide of H2B Ser33 increases from 50% in 
the canonical system to 60% in the Ll-Mutant structure. In H2A.Z, the hydrogen bond¬ 
ing at these locations increases drastically to 73% and 85%, respectively. Interestingly, the 
macroH2A nucleosome shows a decreased frequency of both of these interactions (52% and 
30%, respectively). The reduced hydrogen bonding in the macroH2A nucleosome is likely a 
result of sequence deviations in the nearby H2A al helix (canonical: H2A.Z: 

macro: 

The dimer realignment also affects the hydrogen bonding between protein constituents 
in the histone core. In the canonical NCP, an average of 14.8 hydrogen bonds are formed 
between a single dimer and the tetramer, which is in agreement with the ~15 observed in 
the crystal structure. This increases to an average of 16.5 in the Ll-Mutant. The macroH2A 
and H2A.Z nucleosomes display an average of 14.7 and 14.4 hydrogen bonds, which are 
substantially more frequent than the ~8 observed in the initial conhgurations. Therefore, the 
variant dimer reorientation encourages the histones to form hydrogen bonds more frequently 
than in the crystal structures. 



Figure 4: Kullback-Leibler Divergence of dihedral angles for the (a) Ll-Mutant, (b) macroH2A, 
and (c) H2A.Z nucleosomes, using the canonical populations as a reference set. The dimers and 
DNA residues are represented in a rainbow spectrum, where divergence values increase from blue to 
red. The tetramer is shown in magenta, where the tube radius is wider for larger values. Significant 
divergences are observed both in the vicinity of and far from the Ll-Ll interface. 


Histone H2A LI Sequence Influences Dynamics Throughout the 
Nucleosome 

The LI loops not only influence dimer reorientation, but they also perturb the local dy¬ 
namics of residues throughout the nucleosome. Calculations of the Kullback-Leibler (KL) 
divergence between the canonical and variant systems showed the expected disparity in 
dihedral sampling of residues within the LI loop region (Figure 4). However, they also high¬ 
lighted signihcant changes in the dynamics of residues that are distant from these loops. 
In both the Ll-mutant and macroH2A systems, the dimer and tetramer constituents of the 
docking domains have statistically signihcant KL divergence values, indicating that their 
local dynamics are different in these systems relative to the canonical NCP. Although this 
was expected in the macroH2A and H2A.Z systems due to their sequence deviations, the 
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Figure 5: Average individ¬ 
ual residue correlation with LI 
loop residues for (a) canoni¬ 
cal and (b) L 1-Mutant nucleo- 
somes. Thicker, redder residues 
are those with stronger average 
correlations with the LI loop se¬ 
quence. The L 1-Mutant nucle- 
osome shows increased correla¬ 
tions near the Ll-Ll interface, as 
well as among H2B-H4 four helix 
bundle residues. Both systems 
display appreciable correlations 
between the LI loops and dock¬ 
ing domain residues in the dimer 
and tetramer. Histone tails were 
truncated to improve clarity. 


Ll-mutant is sequentially identical to the canonical system 
in these areas. Therefore, the observed difference in dy¬ 
namics must be due to allosteric networks that are shifted 
by the LI mutation. On the other hand, the dynamics in 
the histone core of H2A.Z only show small differences from 
the canonical system, most notably along the H2B ol2 helix, 
while the largest divergences are in the DNA. 

The Ll-mutant and macroH2A variants also have in¬ 
creased dynamical correlations between the H2A LI loops 
and key portions of the NCP (Table 2 and Figure 5). The 
strengthened interactions in the LI loops increase the Ll- 
Ll’ correlations from 0.42 in the canonical system to >0.67 
in each of the variants. In both the canonical and H2A.Z 
systems, the average correlation between the LI loops and 
either docking domain (symmetric - DD, opposing dimer - 
DD’) was 0.36-0.38. However, in the Ll-Mutant the average 
Ll-DD and Ll-DD’ correlations increased to 0.51 for both 
measurements, which were further increased to 0.56 and 0.60 
in the macroH2A structure. Although the variants had an 
increased correlation between the LI loops and DNA near 
the base of the nucleosome, the correlations between the LI 
loops and DNA extremities were similar in all four systems. 


Variant 

ways 


Presence Alters Allosteric Path- 


The origins of the altered dynamics and correlations in 
NCPs with variants were probed by computing the opti¬ 
mal and suboptimal correlation pathways using with the 
Weighted Implementation of Suboptimal Paths algorithm^®. 
Allosteric networks were calculated between the LI loops 
and the DNA entry and exit sites, and the tetramer compo¬ 
nents of the docking domains for each system. The results 
revealed that not only are there several networks of dynam¬ 
ically coupled residues in the canonical NCP, but that these 
networks are both modihed and strengthened by macroH2A, 
H2A.Z, and the Ll-mutant. The shifts are due to both 
changes in the NCP hydrogen bonding networks from subtle 
repositioning of the H2A histones, as well as increased interactions of the LI loops with one 
another and with the nucleosomal DNA. 

In the Ll-to-symmetric DNA end pathways, the canonical system utilizes three main 
routes for information transfer (Figure 6). In the hrst, networks primarily pass through 
neighboring H2B Ser^^-DNA and H2A oH helix Arg^®-DNA hydrogen bond interactions and 
into the DNA, whereas in the second the networks enter the DNA through the H2A Arg^^- 


Table 2: Average correlations between LI loops and relevant regions of the nucleosome core particle 
for each system. The Ll-Ll’ and Ll-DD(’) correlations are significantly stronger in the systems 
possessing the macroH2A LI loops, while LI correlations to the DNA extremities are unchanged. 
The associated docking domain is abbreviated as DD, and the docking domain of the opposing 
dimer is abbreviated as DD’. 


System 

Ll-Ll’ 

Ll-DD 

Ll-DD’ 

Ll-DNA 

Canonical NCP 

0.42 

0.38 

0.36 

0.49 

LI Mutant 

0.70 

0.51 

0.51 

0.48 

macroH2A NCP 

0.73 

0.56 

0.60 

0.48 

H2A.Z NCP 

0.67 

0.38 

0.38 

0.44 


DNA hydrogen bond near the intradimer interaction site. The third route for propagation 
extends along the H2A a2 helix, which passes dynamic information into the DNA basepairs 
via a Thr™-DNA interaction. The pathways of H2A.Z are similar to the canonical but with 
more pathways accessing the H2A od Arg^®-DNA interaction than that of H2B Ser^^-DNA. 
In the Ll-Mutant, the increased prevalence of the Arg^®-DNA hydrogen bond heavily biases 
information transfer through this network and increases the strength of this pathway. The 
decreased Arg^^-DNA interaction in the macroH2A nucleosome causes information to be 
transferred primarily via the H2A Arg^^-DNA hydrogen bond, with a significant number of 
pathways also traversing the H2A o(2 helix. 

The effects of Ll-Ll’ communication transfer are most apparent in the networks between 
an LI loop and the DNA end of the opposite symmetry. In the canonical nucleosome, there 
exist no pathways between LI loops, therefore networks must pass through indirect routes 
that include the DNA and histone tails. However, in all of the variant structures information 
is readily exchanged between the LI loops, allowing the pathways to immediately cross into 
the opposite symmetry dimer (Supplemental Figures S12-S15). Once information is passed 
into this dimer, it follows the typical pathways for Ll-to-symmetric DNA end propagation. 
This results in allosteric networks that are not only stronger, but more direct in the histone 
variants. 

Pathways between the LI loop and docking domains in the same dimer are similar in all 
systems, but there is a large disparity in the pathways between LI loops and the docking 
domain of the other dimer constituent. In the canonical NCP, the majority of paths pass 
from the LI loops through the H2B a2 helix into the tetramer portion of the docking domain 
via the four helix bundle of H2B-H4. The Ll-Mutant structure shows an increased number 
of contacts in this region, creating a more diverse set of pathways between bundle helices. 
The macroH2A nucleosome displays an alternate route in which pathways instead access the 
docking domain region via protein-DNA interactions. The H2A.Z system uniquely passes 
information along the H2B a2 helix of the opposing dimer. Pathways in H2A.Z also access 
the protein-DNA type route of macroH2A and the four helix bundle route of the canonical 
and Ll-Mutant systems, but at a reduced frequency. 
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(a) 





Distance of Path 


Figure 6: The 500 sub-optimal pathways between the LI loops and symmetrically associated 
DNA entry for the (a) canonical NCP, (b) Ll-Mutant, (c) macroH2A, and (d) H2A.Z projected 
on simulation snapshots. Also shown is the histogram of pathway distances (e). The LI and DNA 
sites are represented as blue spheres, and the pathways are outlined in red with the wider pathways 
representing those of shorter “distance.” Pathways in the variants are shorter, and thus stronger, 
than in the canonical NCP. 
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PTM Targets are Located at Allosteric Hotspots 

Beyond these specific pathways, dynamic networks exist throughout the NCP. To discern the 
importance of individual residues on these global networks, the edge-betweenness centrality 
of nucleosome residues was computed (Figure S8)."^^ In the canonical NCP, a majority of 
the optimal pathways rely heavily on the DNA basepairs and neighboring histone tail lysine 
and arginine residues to propagate communication throughout the system. In H2A.Z, an 
increased number of shortest pathways access the LI loops and the H4 a2 helix, but there 
remains a heavy reliance on the DNA and histone tails. In the Ll-Mutant and macroH2A 
nucleosomes, dynamic traffic to the four-helix bundle increases. Furthermore, pathways in 
these systems access LI residues more frequently than any other region. 

Interestingly, we find that residues with the highest edge-betweenness scores are more 
likely to be the sites of post-translational modifications (PTMs). Based on the distribution 
of centrality scores, we classify residues in the upper tenth percentile as “hotspots” for 
communication (see supplemental). A comparison of known PTM sites with these allosteric 
hotspots indicates that PTMs are enriched at these locations, with an enrichment factor 
of 254% (p-value of 0.0155). When we compare our “hot spot analysis” with known PTM 
sites, we observe a significant population of PTM targets (Figure 7). While PTMs in 
the histone core are identified more frequently than those in the tails, the most significant 
subset contains PTMs that have been implicated in affecting mononucleosome stabilities 
(“monoNCP PTMs”).^^ In relation to the types of PTMs, methylation sites are linked with 
allosteric hotspots more frequently than phosphorylations or acetylations, likely due to their 
presence at DNA entry/exit sites and between turns of superhelical DNA. 



Figure 7: (a) Receiver Operator Curves (ROC) for subsets of PTM sites in the canonical nucleo- 
some as identified by edge-betweenness centrality ranking. The largest enrichment can be seen in 
the PTM subset of monoNCP altering PTMs. The core PTMs are also more frequently identified 
than the tails, (b) ROC for monoNCP altering PTMs across the variant systems. The canonical 
and H2A.Z systems are shown to depend greater on monoNCP altering PTMs than the Ll-Mutant 
and macroH2A systems for distributing dynamic information. 

An overall correlation between allosteric hotspots and PTM locations is maintained in 
the nucleosome variants, however the specific details differ between the systems (Table S2). 
For example, all four systems show the importance of PTM sites in the H3 histone near 


11 





























the DNA extremities, while histone H4 monoNCP PTM sites in the four-helix bundle are 
accessed more frequently in variant networks. In general, the canonical system displays the 
greatest reliance on monoNCP PTMs, then the H2A.Z nucleosome, and hnally the Ll-Mutant 
and macroH2A systems, respectively. 

Structural Stability in Variant Nucleosomes 

To quantify global NCP dynamics, a full correlation 
analysis (PCA) was performed on the atoms of the 
histone core a-helices.®"^ Two of the dominant motions 
identihed in this analysis corresponded to the nucle¬ 
osome opening motions described by Bbhm et 
Projections into the phase-space described by these 
and other FCA modes showed that all nucleosome 
systems had similar global dynamics on the hundreds 
of nanoseconds timescale (see Supplemental Figure 
S6). However, given that the dynamics of nucleosome 
opening likely occur on the millisecond timescale, our 
simulations are far too short to effectively explore the 
effects of histone variants on large-scale NCP dynam¬ 
ics. 

In contrast to the nucleosome opening motions, 
the DNA end-to-end separation distance does depend 
on the identity of the H2A histone (Figure 8). The 
sampling in the canonical system can be divided into 
two states: the prominent “compact” state centered 
around 67 A and the “open” state centered at 71 
A. The Ll-mutant sampled both the open and closed 
states, however the percentage of time spent in the 
open state was reduced from 14% to 11% of the sim¬ 
ulation. Both the macroH2A and H2A.Z systems only sampled the closed state. 

Results of an MM/GBSA analysis indicate that the overall DNA binding energetics are 
also altered by H2A variants. The DNA binding affinities to the Ll-Mutant and macroH2A 
octamers were 31.0 ± 9.1 kcal/mol and 5.7 ± 9.7 more favorable than binding to the canonical 
NCP (Table 3). There are two primary contributors to this shift: direct interactions with the 
LI loops (AAGli) and the changes in the DNA conhguration (AAGdna)- In fhe Ll-mutant 
and macroH2A systems, AAGli was largely a result of removing the negatively charged Glu"^^ 
from the canonical loop and introduction of the Lys^'^-DNA interaction, which combine for 
an increase in binding free energy on the order of 10 kcal/mol. The reorientation of the 
dimers in the macroH2A-hke systems also influences a favorable shift in DNA conformation 
relative to the canonical system (AAGdna = -9-3 ± 5.6 kcal/mol and -17.0 ± 1.8 kcal/mol 
for the Ll-Mutant and macroH2A nucleosomes, respectively). However, in macroH2A a 
number of small shifts in the remainder of the NCP oppose binding and therefore make it 
more comparable to the canonical system, which does not occur in the Ll-mutant NCP. The 
H2A.Z nucleosome does not exhibit the same favor ability for DNA binding when compared 



Distance (A) 

Figure 8: Distance populations for 
DNA end-to-end spread. The canoni¬ 
cal system exists in two states: one cen¬ 
tered around 67 A (“compact”) and one 
centered around 71 A (“open”). While 
the Ll-Mutant samples both states, the 
amount of time spent in the open state 
is drastically reduced. The macroH2A 
and H2A.Z nucleosomes exist only in 
the compact state. Fits are represented 
in dotted lines. 
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Table 3: MM/GBSA calculated binding energies for DNA binding affinity to the histone core in 
each of the NCP systems. AAG’s are referenced against the canonical NCP. The LI loop sequence 
and DNA conformations of the variant structures contribute significantly toward favorable binding 
of DNA in the macroH2A-like systems, relative the canonical NCP. All values are reported in 
kcal/mol. 


System 

^Gbinding 

binding 

AAGli 

AAGdna 

AGassembiy 

A AGassembiy 

Canonical NCP 

-428.6 ± 5.6 




-618.6 ± 5.8 


LI Mutant 

-459.6 ± 7.2 

-31.0 ± 9.1 

-8.7 ± 0.1 

-9.3 ± 5.6 

-679.6 ± 7.5 

-60.0 ± 9.5 

macroH2A NCP 

-434.5 ± 7.9 

-5.7 ± 9.7 

-10.7 ± 0.1 

-17.0 ± 1.8 

-637.1 ± 7.9 

-18.5 ± 9.8 

H2A.Z NCP 

-423.0 ± 6.9 

5.6 ± 8.9 

-5.8 ± 0.1 

5.8 ± 5.4 

-616.3 ± 6.9 

2.3 ± 9.0 


to the canonical octamer, but instead shows a disfavoring shift of 5.6 ± 8.9 kcal/mol. The 
removal of the negative charge on Glu'^^ creates a favorable shift of 5.8 ± 0.1 kcal/mol 
in AAGli, but this is balanced by the nearly identical free energy penalty in the DNA 
rearrangement term AAGdna- 

Similarly, another MM/GBSA analysis revealed that macroH2A variants modify the en¬ 
ergetics of complex assembly. The AGassembiy of the Ll-Mutant and macroH2A nucleosomes 
were -60.0 ± 9.5 and -18.5 ± 9.8 kcal/mol more favorable than the canonical system. The 
favorability in the macroH2A-like systems is a result of favorable DNA binding coupled with 
stronger protein-protein interactions. The AGassembiy of H2A.Z was in agreement with that 
of the canonical nucleosome (Table 3). 


Discussion 

The simulations and analysis presented here detail a series of mechanisms by which the 
histone variants macroH2A and H2A.Z influence the dynamics of the nucleosome core par¬ 
ticle. The subtle structural rearrangements these variants cause leverage the tightly packed 
nature of the histone core to influence the global energetics and dynamics of the complex, 
thus influencing gene expression. Dynamic effects appear to be particularly important, as 
they allow for the propagation of information through allosteric networks that span large 
distances. Although our simulations are only able to probe the sub-ps timescale, the dy¬ 
namic differences observed at the dimer-tetramer and DNA/histone interfaces will likely be 
amplihed on the ms timescale and result in these variants having altered nucleosome opening 
and DNA breathing motions. 

These results also offer new insights into biochemical experiments that probed the mech¬ 
anism of macroH2A. For example, Nusinow et al. showed that the Ll-mutant is enriched in 
the inactive female X chromosome at nearly the same rate as the complete histone-domain of 
macroH2A. Point mutations demonstrated that enrichment was signihcantly increased by 
the two mutations that introduce additional bulk into the LI loops, N38H and E41Y, whereas 
it was decreased by the Y39P mutation, which decreases the size of the LI loop. Based on 
our results, we believe that larger sidechains may help encourage the a2 “bulging” motion 
observed in each of the variant simulations, and therefore make the NGP more variant-like. 

In another set of experiments, Ghakravarthy et al. demonstrated that mutations to the 
LI loops modulate the salt-dependent stability of the histone octamer. They showed that 
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in both the Ll-mutant and macroH2A-containing system, the histone octamer is stable down 
to 0.5 M NaCl, whereas the canonical and H2A.Z-containing structures dissociate into dimer 
and tetramer constituents in solutions below 1.1 M. In agreement with this, we observe a 
signihcantly more favorable interaction between the H2A LI loops in the variant structures 
than in the canonical structure. Since the Ll-Ll interface is the only location of dimer-dimer 
interaction, stability in this region translates to octamer stability. 

The mechanisms of H2A.Z remain more elusive. Stability studies have been non-conclusive 
as some indicate that H2A.Z enhances stability,®® while others suggest that it destabilizes the 
nucleosome. Our simulations show H2A.Z nucleosome stabilities that are in agreement with 
the canonical system, despite their differing dynamics. These systems were constructed with 
identical sequences, except for H2A composition. Therefore, our hndings support a mecha¬ 
nism which suggests that H2A.Z by itself has little-to-no effect on NCP stability. Instead, 
H2A.Z presence may be combined with other factors - such as PTMs or H3 variant presence 
- in order to alter particle stability. ^^’®® Furthermore, the altered dynamics and locations of 
allosteric networks and hotspots between H2A.Z and canonical nucleosomes may result in 
different responses to these chromatin remodeling factors. The dimer reordering may also 
act to recruit transcriptional machinery to chromatin possessing large populations of H2A.Z, 
such as transcriptional starting sites. 

Finally, we present a comprehensive analysis of the dynamic networks in the nucleosome. 
We observe that these networks are strongly affected by the dynamics of the LI loops which 
are allosterically linked to a wide number of important regions in the nucleosome core. 
Using only small changes in their structure, variants are able to modify these networks to 
affect the function of the NCP. We hypothesize that this is a general mechanism that other 
chromatin remodeling factors may also utilize. For example, the hnding that PTMs are 
enriched at residues with increased allosteric activity suggests that these perturbations may 
take advantage of dynamic networks to amplify their effects on chromatin and influence 
global NCP dynamics. In addition, by altering these networks, variants may be able to tune 
the responses of nucleosomes to specihc PTMs. Future work to study the disparate effects of 
chromatin remodeling factors on dynamics in the nucleosome is required to fully understand 
the mechanisms of in vivo gene expression and regulation. 


Supplementary Information 

Complete methods; Tables S1-S2; Figures S1-S23. 
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Methods 

System Construction and Simulation Details 

The canonical nucleosome was initiated from the crystal structure of Daveys et al. (PDB 
ID: 1KX5).^^ The crystallographic Mn^+ were replaced by physiological Mg^+. Additional 
Mg^+ ions were added to hll symmetrically suggested voids. The crystallographic waters 
were also maintained. The Ll-Mutant structure was then created from the canonical one by 
mutating the H2A LI loops to sequence of macroH2A. The mutation 

was done using VMD. The macroH2A system was initialized from the crystal structure solved 
by Chakravarthy et al. (PDB ID: 1U35).®^ The missing tail segments were constructed using 
the canonical structure as a reference. Strong similarities in DNA arrangement - particularly 
at DNA-protein binding sites (Figure SI) - allowed for the 146 basepairs of DNA from the 
crystal structure to be replaced by the 147 bp (plus Mg^+) of the 1KX5 structure, and 
H3 residues were mutated to match the sequence of the 1KX5 system. These actions were 
taken to ensure that differences between the systems were attributable only to H2A sequence 
divergence. The H2A.Z system was constructed analogously, using the crystal structure of 

*To whom correspondence should be addressed 
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Suto et al. (PDB ID: 1F66).®^ Therefore, each system was composed of 147 palindromic 
basepairs of a-satellite DNA wrapped around a histone core of Xenopus laevis H3, H4, and 
H2B with human H2A histones and variants. Histidine states were assigned using PROPKA 
and the interactive H-Bond Optimizer of the Desmond-Schrddinger package.®^ 

Each system was simulated three times. Each simulation underwent 10,000 steps of 
geometric minimization (5,000 steps with protein heavy atoms harmonically restrained by 
a force constant of 10 kcal/mol/A^ and 5,000 steps without restraints). Heating was done 
by gradually raising the temperature from 10 to 300 K over 6 ps of simulation in the NVT 
ensemble. During heating, protein heavy atoms were harmonically restrained with a force 
constant of 10 kcal/mol/A^. The restraints were then gradually released over GOOps in the 
NPT ensemble.®^ Each simulation was then conducted for an additional 250ns in the NPT 
ensemble using a Langevin piston with a 100 fs period and collision frequency of 3 ps"^. 
The SHAKE algorithm was used to allow for a 2 fs timestep, and long-range electrostatics 
were calculated using the particle mesh Ewald method. Short range interactions were 
calculated with a 10 A cutoff, where a switching function was applied at 8 A. It was observed 
that '^50ns was required for system equilibration, and so ~200ns of production data was 
obtained from each simulation (600ns per system). 

Allosteric Calculations 

Residue correlations were calculated using the “largest linear mutual information” method. 

In this method, the linear mutual information is calculated between all heavy atoms in the 
system. The residue-wise mutual information values were converted to a Pearson correla¬ 
tion coefficient-like value by = [1 — where 7*^ is the largest linear mutual 

information between any two atoms of the residues i and j. 

Contact maps were produced in-house using the MDAnalysis package. Two residues 
were considered to be in contact if their Ca (protein) or Cl’ (nucleic) atoms were within 
10 A for 70% of the conhgurations. Using the predehned correlation matrices and this 
contact map, a NetworkX edgelist was formed. The length of each edge was dehned by 
Di^j = —log{rij), where rij is the correlation value between residues i and j. The optimal 
paths were calculated using the NetworkX implementation of Dijkstra’s algorithm, and the 
suboptimal paths were determined using Yen’s K-Shortest Paths algorithm. 

The Kullbach-Leibler Divergence of dihedral angles were calculated using the method of 
McClendon. In this method, each simulation was separated into three blocks (9 blocks per 
system) with 31,666 conhgurations (roughly 63.3 ns) per block. Histogram widths were 15 
degrees. The Kullbach-Leibler Divergence values were calculated for each variant by using 
the populations of the canonical system as a reference set. Bootstrapping techniques were 
employed to calculate the self-divergence of the canonical system. For any residue in a variant 
system whose divergence value was below this self-divergence threshold, the KL-Divergence 
value for that residue was set to 0. 

Edge-betweenness Centrality 

The importance of a node in a communication network can be dehned by its edge-betweenness 
centrality. In this method, the “shortest” correlation pathway between all residue-pairs is 
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calculated. A residue’s edge-betweenness centrality is then defined as the number of shortest 
paths in which the residue appears: 


^ ( 1 ) 

s^t^i 

where N is the total number of paths and Xi(s,t) is either 0 (residue i does not exist in 
path between residues s and t) or 1 (residue i does exist in said path). For visualization 
purposes, centrality values are normalized such that the minimum centrality is 0 and the 
maximum centrality is 1, according to the formula: 

^ ~ Cmin 

'-'norm ^ ^ 


where Cmax and Cmin are the maximum and minimum centrality values in the network. 

From the plot of normalized centrality value vs percentile (Figure S9), we observe that 
the difference in centrality between percentile increments is not constant but is large at the 
upper and lower quartiles and steady in the interquartile region. The inflection point of the 
upper quartile exists near the tenth percentile, so we have chosen this location as our cut-off 
for defining allosteric “hotspots.” 

We were able to identify several well-known post-translational modifications (PTMs) in 
the canonical nucleosome by our betweenness centrality measurement. In the canonical 

system, we observe 6 of 23 mononucleosome altering PTM sites (monoNCP PTMs) in the 
upper tenth percentile and 12 of 23 in the upper quartile. The significance of observing this 
subset of residues in each percentile was tested by calculating the pmf of a hypergeometric 
distribution,®^'^ 


pmf [x = k) 



( 3 ) 


where N is the total number of protein residues (487), n is the percentile population size 
(n=49 for the upper tenth, and n=122 for the upper quartile), K is the total number of 
monoNCP PTM residues (23), and k is the number of observed monoNCP PTM sites (k=6 
for the upper tenth, and k=12 for the upper quartile). Using these values, the upper tenth 
percentile observation has a p-value of 0.0155, and the upper quartile p-value is 0.00288. 
Therefore, the observation that monoNCP PTMs are located at allosteric hotspots is statis¬ 
tically significant. 

Furthermore, we can quantify the presence of monoNCP PTMs at allosteric hotspots by 
calculating the enrichment factor (EF) of monoNCP PTMs over random selection. 


EF 


k N 


( 4 ) 


where the variables have the same meaning as for the hypergeometric distribution. We then 
calculate an EF of 2.54 for monoNCP PTM presence at allosteric hotspots. A plot of EF vs 
centrality percentile can be found in Figure Sll. 
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Figure SI: Comparison of crystallographic DNA arrangment in the canonical (blue), macroH2A 
(red), and H2A.Z (grey) nucleosomes. As expected, the DNA-histone binding sites show strong 
agreement in coordination between the three structures. 


Table SI: Hydrogen bonding at key locations in the nucleosome. For the H-Bonds formed with 
the DNA, the occupancy of each bond is given. For the other interactions, the average number of 
hydrogen bonds between each group in a given frame is listed. 


System 

H2A R^^-DNA 

H2B S^NdnA 

Ll-Ll 

Dimer-tetramer 

Canonical NCP 

63% 

40% 

0.5 ± 0.1 

14.8 ± 1.1 

LI Mutant 

70% 

60% 

0.4 ± 0.1 

16.5 ± 1.0 

macroH2A NCP 

52% 

30% 

0.2 ± 0.1 

14.7 ± 0.8 

H2A.Z NCP 

73% 

85% 

0.1 ± 0.1 

14.4 ± 1.1 
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Figure S2: Backbone RMSD of three canonical simulations. The simulations were fit to the histone 
core backbone, and the RMSD calculations were done on the DNA and histone core backbone atoms, 
excluding tail residues. 
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Figure S3: Backbone RMSD of three Ll-Mutant simulations. The simulations were fit to the his¬ 
tone core backbone, and the RMSD calculations were done on the DNA and histone core backbone 
atoms, excluding tail residues. 
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Figure S4: Backbone RMSD of three macroH2A simulations. The simulations were fit to the his¬ 
tone core backbone, and the RMSD calculations were done on the DNA and histone core backbone 
atoms, excluding tail residues. 



Figure S5: Backbone RMSD of three H2A.Z simulations. The simulations were ht to the histone 
core backbone, and the RMSD calculations were done on the DNA and histone core backbone 
atoms, excluding tail residues. 
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Figure S6: Dimer dissociation phase space for the nucleosome systems. All systems sample 
the same region of this space, which suggests that H2A composition has little effect on dimer 
dissociation in the hundreds of nanoseconds timescale. For reference, the dimer separation event 
occurs on the millisecond timescale. 


• Canonical 

• Ll-Mutant 

• macroH2A 
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Figure S7: Pearsified Largest Linear Mutual Information matrices for (a) canonical, (b) Ll- 
Mutant, (c) macroH2A, and (d) H2A.Z nucleosomes. The canonical NCP shows the weakest 
average correlation across the whole molecule, and the macroH2A variant shows the strongest. 
The Ll-Mutant correlation strengths are similar to macroH2A, while the H2A.Z nucleosome shows 
correlations that are slightly above the levels of the canonical nucleosome. The steady increase 
in correlations within the variant systems is likely a result of favorable changes in interhistone 
interactions. 
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Figure S8: Edge-betweenness centrality for (a) canonical, (b) Ll-Mutant, (c) H2A.Z, and (d) 
macroH2A nucleosomes. Brighter, wider regions represent locations that are accessed more fre¬ 
quently in the optimal communication pathways of each nucleosome system. The H2A Ll-Ll 
interaction region in the Ll-Mutant and macroH2A systems act as communication hubs for al¬ 
losteric networks in the nucleosome, whereas the canonical and H2A.Z nucleosomes rely heavier on 
DNA to propagate information. 
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Figure S9: Centrality scores vs percentile ranking (blue dots) for the canonical nucleosome. The 
spline fit is represented in red. The drastic change in centrality score in the upper quartile indicates 
that residues rely heavily on several key residues for information propagation. The inflection point 
of this trend is located at the tenth percentile. 



Figure SIO: ROC plot for the canonical nucleosome with PTMs separated by modification type. 
Methylations exist most prevalently as allosteric hotspots, and acetylations are the least prevalent. 
The early enrichment of methylations is a result of their presence near DNA extremities and between 
superhelical DNA turns where pathways cross the symmetries. 
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Figure Sll: Enrichment Factor (blue) and monoNCP PTM identification percent (red) as func¬ 
tions of centrality percentile in the canonical nucleosome. We observe a strong degree of early 
enrichment for identifying monoNCP PTMs at allosteric hotspots. At the cutoff of the tenth 
percentile, we observe an EF of 2.54. 


Table S2: Protein-normalized centrality values for the 23 monoNCP PTM targets. The percentile 
rank of each value is also listed. The residues in the upper quartile are listed in bold. Those in the 
upper tenth percentile are italicized. PTM sites that have significantly smaller centrality values 
than neighboring sequence residues in the upper quartile are labeled by an (*) and the value of the 
neighbor is reported. 



Canonical 


Ll-Mutant 

macroH2A 

H2A.Z 


PTM 

Centrality Percentile 

Centrality 

Percentile 

Centrality 

Percentile 

Centrality Percentile 

H3 K4me3 

0.05 

6.6 

0.07 

6.3 

0.05 

6.2 

0.06 

6.7 

H3 K9ac 

0.13 

27.0 

0.19 

22.6 

0.14 

24.8 

0.16 

27.5 

H3 K14ac 

0.20 

75.6 

0.30 

51.0 

0.23 

53.4 

0.26 

61.2 

H3 KlSac 

0.27 

87.9 

0.39 

73.9 

0.30 

69.8 

0.34 

78.4 

H3 K23ac 

0.34 

92.8 

0.51 

87.3 

0.38 

82.0 

0.44 

88.9 

H3 K36me2,3 

0.47 

97.5 

0.80 

97.3 

0.45 

86.1 

0.69 

97.7 

H3 Y41ph 

0.46 

96.9 

1.00 

100.0 

0.67 

96.9 

0.84 

99.8 

H3 R42me2a 

0.51 

98.4 

0.79 

96.7 

0.37 

81.5 

0.79 

99.2 

H3 T45ph* 

0.29 

75.0 

0.38 

71.1 

0.22 

52.4 

0.28 

67.0 

H3 K56ac 

0.17 

58.9 

0.18 

20.1 

0.22 

52.4 

0.19 

35.6 

H3 S57ph 

0.15 

39.0 

0.16 

17.9 

0.22 

52.4 

0.16 

28.6 

H3 K64ac 

0.15 

42.7 

0.26 

38.0 

0.27 

63.6 

0.21 

46.1 

H3 K115ac 

0.15 

38.4 

0.13 

11.7 

0.06 

8.5 

0.24 

53.9 

H3 TllSph 

0.37 

93.4 

0.43 

81.9 

0.14 

25.8 

0.63 

96.5 

H3 K122ac 

0.23 

81.9 

0.29 

47.4 

0.13 

23.9 

0.39 

82.7 

H4 K16ac 

0.21 

77.0 

0.38 

70.0 

0.16 

33.9 

0.18 

34.2 

H4 S47ph* 

0.25 

85.4 

0.28 

43.9 

0.19 

44.3 

0.34 

78.2 

H4 K77ac 

0.18 

61.8 

0.29 

45.2 

0.15 

28.3 

0.25 

58.0 

H4 K79ac* 

0.31 

91.4 

0.34 

63.8 

0.56 

95.0 

0.35 

79.8 

H4 K91ac 

0.12 

23.4 

0.38 

70.0 

0.57 

95.4 

0.29 

68.9 

H4 R92me 

0.21 

76.4 

0.53 

88.5 

0.81 

99.4 

0.33 

77.6 

H2B K30ar 

0.94 

99.6 

0.81 

97.7 

0.26 

62.4 

0.72 

98.8 

H2B K123ubl 

0.00 

0.0 

0.00 

0.0 

0.00 

0.0 

0.00 

0.0 
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Figure S13: Sub-optimal pathways for LI loop to DNA exit in the Ll-Mutant nucleosome. 
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Figure S14: Sub-optimal pathways for LI loop to DNA exit in the macroH2A nucleosome. 
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Figure S15: Sub-optimal pathways for LI loop to DNA exit in the H2A.Z nucleosome. 
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Figure S16: Sub-optimal pathways for LI loop to the associated docking domain in the canonical 
nucleosome. The opposing dimer has been removed to improve visualization clarity. 
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Figure SI7: Sub-optimal pathways for LI loop to the associated docking domain in the L 1-Mutant 
nucleosome. The opposing dimer has been removed to improve visualization clarity. 


S19 



Figure S18: Sub-optimal pathways for LI loop to the associated docking domain in the macroH2A 
nucleosome. The opposing dimer has been removed to improve visualization clarity 
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Figure S19: Sub-optimal pathways for LI loop to the associated docking domain in the H2A.Z 
nucleosome. The opposing dimer has been removed to improve visualization clarity. 
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Figure S20: Sub-optimal pathways for LI loop to non-associated docking domain in the canonical 
nucleosome. The opposing dimer has been removed to improve visualization clarity. 
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Figure S21: Sub-optimal pathways for LI loop to non-associated docking domain in the Ll- 
Mutant nucleosome. The opposing dimer has been removed to improve visualization clarity. 
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Figure S22: Sub-optimal pathways for LI loop to non-associated docking domain in the macroH2A 
nucleosome. The opposing dimer has been removed to improve visualization clarity. 
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Figure S23: Sub-optimal pathways for LI loop to non-associated docking domain in the H2A.Z 
nucleosome. The opposing dimer has been reduced to only H2B a2 to improve visualization clarity. 
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